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Positions that can be changed without affecting enzyme activity (demonstra ted) 
positions -1 and -2 are not natural. Ihe two amino acids are added due to 

cloning strategies 

positions 1 to 10: can be deleted 
position 36: G Is tolerated 
Dosition i»0: R or V are tolerated 
position ^1: S or N are tolerated 
position A3: 7i Is tolerated 
position US: V or N are tolerated 
position 91: I is tolerated 
positions 123 and 156: L are tolerated 
position 223: A and S are tolerated 

Changes that affect enzyme activity (demonstrated) 

position 19: L to b 

position 38: T to S or N 

position 39: ^ to U or E 

position 40: I to Q 

position ij2: r to 1? 

position tiH: D to E 6 or H 

position US: 7f to E or D 

position 16: Y to U 

position H7: T to 5 or N 

position 80: E to 1 

position lUk: D to E P I r:: 

position 115: U to E r I O. 

position 116: G to E 
position 117: 5 to S 



Group I Intron Encoded Endonucleases and Related Endonucleases 



ENOONUCLEASE 



RECOGNITION SEQUENCE 



CLEAVAGE 



5f|^ V 'NTRON SITE 



CO 



a. 
m 



o 



o 



I-Sce I 
(Saccharomyces mitochondria) 

I-Sce lY 
(Saccharomyces mitochondria) 

I-Sce H 
(Saccharomyces mitochondria) 

I-Ceu I 
(Chlomydomonas chioroplast) 

I-Ppo I 
(Physarum nucleus) 

I-Sce m 
(Sacchoromyces mitochondria) 

I-Cre I 
(Chlomydomonas chioroplast) 



CGC|TAGGGAiiAlCAG6GT7miA TAGC 
G CG AT C C CIT AT T G T C C CAT T A T ATCG 



ttctcatgauSIgctctaatccatgg 

A AG AG T A CjTAATC G A G AT T A G G T A C C 

CtTTTGGT^AIiLdAGAAGTAiTAT ATTT 
GAAACCAGITAGGTCTTCATATATAAA 



TAACGGT'CCIA^GGTAGCGAAATTCA 
ATTGCCAGlGATTCCATCGCTTT AAGT 

2- 



TGACTCT CXImIgGTAGCCIAAATGCC 
ACTGAGAGIAAT TCCATCGGTTT ACGG 



GGAGGTT TTGGTAA CT ATTT at TACC 
CCT CCAAAACCATT GATAAATA ATGG 

V 

GGGTT CAAAACGT CGTGAGACAGTTT 
CCCAAGTTTTGCAGCACTCTGTCAAA 



UI 



Endo. See I(RF3) 
(Saccharomyces mitocnondrio) 
(Non intronic) 

HO 

(Sacchoromyces nucleus) 
(Non intronic) 



GAT GOT GTMI^AT AGGCTTGGTTAT 
CIA CGACAITCCGTAT CCGAACCAATA 



CtTT TCCGCAA^LAIGT ATAATTTTATAA 
GAA AGGCGITTGTCATATTAAAA TATT 



I-Csm I 



(Chlamvdomonas mitochondria) 
(Putative endonuclease) 

I-Pan I 
(Podosporo mitochondria) 
(Putative endonuclease) 



ACCAT GGGGTCAAAT GTCTTTCTGGG 
TGGTACCCCAGTTT ACAGAAAGACCC 

V 

GTGCCTGAATGAT AT TTATTACCTTT 
CACGGACTTACTAT AAATAATGGAAA 



ZD 



s 

i 



(Bacteriophage T4) 



I Tev I 



3t 



I Tev It 



I Tev in 



„GCTCAGTA6ATGTTTTCTT GGGT C TACCGTTTAAT 
GCGAGTCAT CTACAAAAGAACCCA GATGGCAAATTA 



CAAGCTTATGAGT ATGAAGT G AACACGIJTATT 
GTTCGAATACTCATACTTCACTTGTG[CAATAA 

GOT attcgttttIlat GTAT CTTTTGC^GTAGCTTTAA 
CGATAAGCAAAATTnA CAT A GAAA AC G C ACAT CGA A AT T 



FIG. 6 



EXPRESSION VECTORS 

EcoRI EcoRI 




FIG. 7 




I EcoR I 2642 
Pst I 2848 



FIG. 8 




FIG. 9 



Sou3A I 
Mbo I 
Dpn n 
ScrF I 
Nc? I 
Msp I 
Hpa n 
Dsa V 
BstK I 
Xma I 
Sma I 
ScrF I 
Nci I 
Dsa V 

RsG I Dpn I 
Nla IV NIq IV 
Sac I Csp6 I Alw I 
HgiA I BstK I Taq I 

Ed 136 I BsaJ I Sal I 

Sph I Rma I Kpn I BstY I Hinc U 
NspC I Bspl286 I Ben I Sfe I 
EcoR I Xba I Bon I BannH I Acc I I-Sce I 

Apo I Nio lEC Ban 11 Ben I Sfe I Rma I 

Taq I Nsp7524 I Alu I Ava I Pst I Hga I 
AIu I Nsp I Bfa I Asp718 Aiw I BspW I Bfa I FcoR V 

III II MM II II II I i 11 I II I 

CCAAGCTCGAATTCGCATGCTCTAGAGCTCGGTACCCGGGATCCTGCAGTCGACGqiAGOGAIAACAGGGTAAm 2320 
GGnCGAGCnAAGCGTACGAGATCTCGAGCCATGGGCCCTAGGACGTCAGCTGCG MCCCTATTGTCCCATTA rCTCTA 

III- II -M II -M Mil 1 I M 1 \T~' : — ldoiwim 

2244 2255 2262 2271 2279 2286 2296 2318 

2247 2255 2256 2275 2284 2292 

2249 2256 2265 2275 2284 2296 

2249 2261 2271 2279 2289 2297 

2255 2265 2276 2284 

2255 2262 2271 2279 2289 
2265 2275 2289 

2265 2275 2290 

2265 2272 2280 
2271 2279 
2272 2280 
2275 
2275 
2275 
2275 
2275 
2275 
2276 
2276 
2276 
2276 
2276 
2276 

2280 
2280 
2280 



FIG. lOA 



ScrF I 

Nci I 

Msp I 

Hpo U 

Dsa V 

BstK I 

Ben I 
Xma I 
Sma I 
ScrF I 
Nci I 
Dsa V 
BstK I 
BsaJ I 
Sau3A I 

Mbo I Sac I 

Dpn n HgiA I 

Alu I Sfe I Dpn I Rsa I 

Hmd m Sfc I Pie I Nia IV Csp6 I 
Dde I Pst I Hlnf I BslY I NIo IV EcoR I 

Sau3o I Sse8337 I BomH I Kpn I Apo I 

Mbo I NIa HL Taq I Alw I Ban I Alu I BceF I 

Dpn II Sph I Sal I Rma I Ben I Eel! 36 I Hoe III 

Dpn I NspC I Hinc II Mnl I Ave I Bsp1286 I Gdi IE 

BstY I Nsp7524 I Ace I Bfo I BsaJ I Bon II Eae I 

Bd II Nsp I BspM I Xba I Alw I Asp718 Taq I Bsr I 

if ! II II III II I 11 I M I If II Nil I 111 

ATCAGATCTAAGCnGCATGCCTGCAGGTCGACTCTAGAGGATCCCCGGGTACCGAGCTCGAATTCACTGGCCGTCGTTT 2400 

TAGTCTAGAnCGAACGTACGGACGTCCAGCTGAGATCTCCTAGGGGCCCATGGCTCGAGCTTAAGTGACCGGCAGCAM 
II III II -III I I • I I [ IN I I I N 11 i • I I 1 M ^^'^ 

2324 2335 2344 2354 2361 2369 2379 2387 

2324 2336 2348 2355 2364 2375 2389 

2325 2336 2348 2358 2365 2375 2389 

2325 2336 2348 2355 2355 2375 2390 

2325 2337 2349 2360 2359 2376 2391 

2325 2341 2360 2369 2381 

2328 2342 2351 2350 2369 2381 

2330 2342 2351 2350 2370 

2331 2342 2351 2370 

2351 2375 

2361 2375 

2361 

2365 
2365 
2355 
2365 
2355 
2355 
2365 

2366 

2366 

2366 

2366 

2366 

236e 

Mae n 2366 
ScrF n 
EcoR H 
Dsa 2 

BstN I Fok I Alu I 

Mce HE BstK I Fnu4H I Pvu U 

Mae n Bsr I BsaJ I Mse I Bbv I NspB II 

ill II I I II II 

TACAACGTCGTGACTGGGAAAACCCTGGCGmCCCAACTTAATCGCCTTGC^ 2480 

ATGnGCAGCACTGACCCTTTTGGGACCGCAATGGGTTGAATTAGCGGAACGTCGTGTAGGGGGA^AGCGGTCGACCGCA 
111 'I! I I M 1 • Ml 

2405 2413 2423 2440 2451 2471 

2410 2424 2451 2471 

2424 2457 2472 

2I24 FIG. lOB 

2430 '"^ 




Construction: pGP 704 from De Lorenzo, with transposase 
gene and insertion of the linker[I— Scel] in NotI unique site 



FIG. I I 



URA 3 




Construction: pD 123. from J.D. Boeke 
with insertion of a Iinker[I-ScoI-NotIJ in BamHI 



FIG. 12 



Sma I 8806 

Nor I 8778 
Kpn I 8810 
Sac I 8743 
EcoR I 831 1 
Nde I 8263 



Soc I 7243 



EcoR V 6418 



Xba I 9009 

Sac I 9033 

Sea I 436 




Sma I 5021- 



Kpn I 2205 

Xho I 2220 
Sal I 2226 

HinD HL 2241 
EcoR V 2402 
Nhe I 2829 

EcoR V 2940 
EcoR V 3015 
Xba I 3096 
Sac I 3212 
Nar I 3247 
Sma I 3275 
Kpn I 3279 



FIG. I3A 
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FIG. I5D 



I-Sce I MAP 



50 


50 


40 


40 


90 


60 


150 


120 


70 


60 


70 


150 


180 



77/ 



\ 




pEKG 100 
pEKG 019 



pEKG 098 



pEKG 090 
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FIG. I5E 
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PHYSICAL MAP 
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REPAIR USING ECTOPIC DUPUCATION 
DELEnON AND GENE CONVERSION 



FIG. ISA 




I-Sce I 



OR 



REPAIR USING HOMOLOGOUS SEQUENCE ON PLASMID 
DELOION OF URA3 AND REPLACEMENT 
PLASMID INTEGRATION 



FIG. I9B 



